An Integrated QAP-Based Approach to Visualize Patterns of Gene Expression Similarity
نویسندگان
چکیده
This paper illustrates how the Quadratic Assignment Problem (QAP) is used as a mathematical model that helps to produce a visualization of microarray data, based on the relationships between the objects (genes or samples). The visualization method can also incorporate the result of a clustering algorithm to facilitate the process of data analysis. Specifically, we show the integration with a graph-based clustering algorithm that outperforms the results against other benchmarks, namely k−means and self-organizing maps. Even though the application uses gene expression data, the method is general and only requires a similarity function being defined between pairs of objects. The microarray dataset is based on the budding yeast (S. cerevisiae). It is composed of 79 samples taken from different experiments and 2,467 genes. The proposed method delivers an automatically generated visualization of the microarray dataset based on the integration of the relationships coming from similarity measures, a clustering result and a graph structure.
منابع مشابه
Interactive Visualization and Analysis for Gene Expression Data
New technology such as DNA microarray can be used to produce the expression levels of thousands of genes simultaneously. The raw microarray data are images which can be transformed into gene expression matrices where usually the rows represent genes, the columns represent various samples, and the number in each cell characterizes the expression level of the particular gene in a particular sampl...
متن کاملO-30: Comparing Expression Patterns of Endometrial Genes in Implantation Failures and Recurrent Miscarriages with Fertile Couples Following ICSI/IVF Using in Silico Analysis
Background: To screen and diagnose patients with recurrent abortions and implantation failure after IVF/ICSI, differentially expressed genes of endometrium through DNA microarrays were monitored. Materials and Methods: Microarray expression profile of GSE26787 dataset from GEO database was used to analyze gene expression profiles of 15 endometrial biopsy samples- five from control fertile (CF) ...
متن کاملA new approach for data visualization problem
Data visualization is the process of transforming data, information, and knowledge into visual form, making use of humans’ natural visual capabilities which reveals relationships in data sets that are not evident from the raw data, by using mathematical techniques to reduce the number of dimensions in the data set while preserving the relevant inherent properties. In this paper, we formulated d...
متن کاملIdentification of Prognostic Genes in Her2-enriched Breast Cancer by Gene Co-Expression Net-work Analysis
Introduction: HER2-enriched subtype of breast cancer has a worse prognosis than luminal subtypes. Recently, the discovery of targeted therapies in other groups of breast cancer has increased patient survival. The aim of this study was to identify genes that affect the overall survival of this group of patients based on a systems biology approach. Methods: Gene expression data and clinical infor...
متن کاملUsing a Kernel-Based Approach to Visualize Integrated Chronic Fatigue Syndrome Datasets
We describe the use of a kernel–based approach using the Laplacian matrix to visualize an integrated Chronic Fatigue Syndrome dataset comprising symptom and fatigue questionnaire and patient classification data, complete blood evaluation data and patient gene expression profiles. We present visualizations of the individual and integrated datasets with the linear and Gaussian kernel functions. A...
متن کامل